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Abstract — The throughput benefits of random tinear network 
codes have been studied extensively for wirelined and wireless 
erasure networks. It is often assumed that all nodes within 
a network perform coding operations. In energy-constrained 
systems, however, coding subgraphs should be chosen to control 
the number of coding nodes while maintaining throughput. In 
this paper, we explore the strategic use of network coding in 
the wireless packet erasure relay channel according to both 
throughput and energy metrics. In the relay channel, a single 
source communicates to a single sink through the aid of a half- 
duplex relay. The fluid flow model is used to describe the case 
where both the source and the relay are coding, and Markov 
chain models are proposed to describe packet evolution if only 
the source or only the relay is coding. In addition to transmission 
energy, we take into account coding and reception energies. We 
show that coding at the relay alone while operating in a rateless 
fashion is neither throughput nor energy efficient. Given a set of 
system parameters, our analysis determines the optimal amount 
of time the relay should participate in the transmission, and 
where coding should be performed. 

Index Tenns — Random linear network coding, wireless relay 
channel, packet delivery energy 

I. Introduction 

Network coding, although initially introduced as a theoreti- 
cal tool in the field of network information theory (T], has been 
made practical by the use of random linear network codes 
(RLNC) 121, El, and has been shown to offer throughput, 
delay, energy and other advantages over classical store-and- 
forward strategies. To minimize the amount of centralized 
control, RLNC is often performed at the source as well as all 
intermediate nodes within a transmission subgraph. Because 
coding operations and data reception by intermediate nodes 
can have non-trivial energy costs, resource constrained net- 
works, such as wireless body area networks (WBANs), could 
potentially benefit from strategies that allow a reduction in 
the number of coding nodes, while maintaining the benefits of 
network coding. This paper studies the strategic use of network 
coding in a three-node wireless packet erasure relay channel, 
as illustrated by Figure [Ha), with an emphasis on whether 
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Fig. 1. Single relay unicast network, with corresponding flow hypergraph. 
Psr represents the packet transmission success probabilities between s and d. 

and where to code when the relay operates in half-duplex 
mode. We propose Markov chain models to characterize the 
system performance in terms of throughput and packet delivery 
energy, thus providing a way to find the optimal fraction of 
time for which the relay should participate in the transmission. 
We show, through numerical analysis, that coding at the 
relay alone while operating in a rateless fashion is neither 
throughput nor energy efficient, while coding at the source 
alone has performances close to the case where coding is 
performed at both nodes. The decision to code is based on 
packet erasure probabilities, transmission energy, as well as 
energy spent on reception and coded packet generation. 

Although seemingly simple, the analysis of the three-node 
network can offer insights to more complicated systems with 
more source or sink nodes. Two such examples are WBAN 
and advanced LTE cellular networks |4j|. In a WBAN, the 
topology is almost always star-shaped: data are uploaded in 
a converge-east sense to a central base station (BS) 15], ||6l. 
Depending on the relative location of a sensor on the human 
body, it may be useful in terms of energy efficiency to deploy 
a relay around the shoulder, in direct line of sight with both 
the front and back of the body. To the best of our knowledge, 
the throughput and energy tradeoffs in this case have not been 
studied before. On the other hand, in advanced LTE systems, 
data are transmitted in both directions, with an emphasis on 
download in a broadcast sense from a central BS to individual 
user equipments. Currently relaying is being considered as an 
improvement tool, for example, for coverage of high data rates, 
and for temporary network deployment |4|. Here the relay is to 
wirelessly connect to the radio-access network, and may either 
function as a smart repeater, or have control of its own cell. For 
both WBANs and advanced LTE systems, the introduction of 
network coding and the insertion of a relay may bring energy 
or throughput gains. As a starting point, we consider the three- 
node packet erasure relay channel, assuming physical layer 
designs are readily available on point-to-point links. Further 
extensions of this setup can involve additional source nodes, as 
in a WBAN, additional sink nodes, as in an LTE system, and 
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additional relay nodes, a scenario applicable to both examples. 

In evaluating system performance, we parametrically model 
the total energy consumption, taking into account transmission 
and reception energies as well as processing energy required 
to generate coded packets. The inclusion of energies for 
data reception, idle listening, and protocol overheads allows 
a more comprehensive understanding of the implications of 
inserting a relay and using network coding. Our goal is to 
characterize the throughput and energy performances of the 
system when a relay is added to a point-to-point link and nodes 
are network coding capable. Our previous work studied the 
energy advantages of network coding in half-duplex WBANs 
with a star-topology and showed that when reception energy 
is taken into account, total energy use could be reduced ||7|. 
The gains are expected to grow with the number of nodes. 

The remaining of the paper is organized as follows. Sec- 
tion In] summarizes some of the previous work related to 
the wireless relay channel and the application of network 
coding in such settings. Differently from previous works 
that focus on joint channel and network coding for optimal 
throughput analysis, in this paper, we assume a physical layer 
design is available, and network coding is inserted into the 
network layer, independently of the source and channel codes 
employed. Section |III] details the assumptions made and the 
system modeled. Section |IV] then discusses three separate 
cases, depending on the coding locations. For each case, we try 
to characterize the expected completion time and the expected 
completion energy of transmitting a given number of packets 
from the source to the destination. Through Markov chain 
analysis, we provide a framework for evaluating bounds on 
the system performance when coding is conducted at the relay 
only, and for determining the system performance when coding 
is conducted at the source only. Section [V] then establishes the 
key results through numerical evaluations, showing that, for 
a wide range of parameters, coding at the relay only is not 
throughput or energy efficient. Section [Vl] concludes the paper 
with discussions on future work. 

II. Related Work 

A relay channel models the problem where two nodes 
communicate through the help of one or more relays. This 
setup is common in multihop wireless networks such as sensor 
networks, where transmission power is limited, or in decen- 
tralized ad hoc networks, where nodes can only communicate 
with their immediate neighbors. Relays can overhear trans- 
missions to the destination, owing to the broadcast advantage 
of the wireless medium. Recently there has been a renewed 
interest in the classical relay channel [HI, ||9l, motivated by 
the potential to achieve cooperative diversity, and thus better 
capacity bounds ifTol - lfTsl . Schemes such as amplify-and- 
forward, decode-and-forward, and compress-and-forward have 
been proposed and studied extensively in terms of capac- 
ity, outage, energy efficiency, and optimal power allocation 
schemes lfT4l . ifTSl . Much of the analysis has focused on the 
fundamental performance limits at the physical layer and on 
the transmission of a single data packet. The introduction 
of network coding into the relay channel has also focused 



mostly on joint channel-network code design, with or without 
limited processing complexity constraints llT6ll - ll2l1 . In larger 
networks, however, network coding typically resides in higher 
layers of the protocol stack, independently of physical layer 
implementations . 

In this paper, we assume network coding takes place at 
the protocol layer, independently of source and channel codes 
employed. Such an assumption on the separation of channel 
and network codes may not necessarily be capacity achieving, 
but allows the introduction of network coding into existing 
systems. 

The use of RLNC in wireless erasure networks under packe- 
tized operations is first studied by Lun et al. 1221 . and extended 
to a scheduling framework by Traskov et al. 1231 . Other 
schemes that employ network coding in a relay setup includes 
the MORE protocol l24l . which performs RLNC at the source 
only to reduce the amount of coordination required by multiple 
relay nodes, and the COPE protocol, which employs RLNC at 
the relay only in a 2-way relay channel to improve reliability, 
taking advantage of opportunistic listening and coding l25]| . 
Fan et al. also proposed a network coding based cooperative 
multicast scheme to show that significant throughput gains can 
be achieved when network coding is performed at the relay 
only l26ll : one assumption in this work is that feedback is 
available from both the destination and the relay to the source 
after each packet reception. In practical systems, feedback can 
be costly in terms of both throughput and energy, depending 
on the underlying hardware architecture [Jl. 

In this paper, we explore rateless transmissions, where the 
acknowledgement for successful reception is sent only once by 
the destination when the transmission of all available data is 
completed. As described in the introduction, we also take into 
account the energy spent on reception and packet processing in 
addition to the energy required to transmit them. Furthermore, 
we assume that a sufficiently large field is used for network 
coding operations, such that transmissions of non-innovative 
packets from the source can be neglected. In terms of energy 
use, we make the simple assumption that coding energy stays 
constant as field size increases, and show that, the decision to 
code depends on the dominating energy term (transmission, 
reception, or code generation). The tradeoff between energy 
budget for the transmission of linearly dependent packets when 
field size is small and the energy budget for code generation 
when field size is large is discussed in l27l . 

III. System Model 
We represent the data flow through the relay channel using 
a hypergraph, as shown in Figure [Ha)- A hypergraph is a 
generalization of a graph: a broadcast link is represented by a 
hyperarc between a single start node and a set of end nodes, 
and a multiple access link is represented by a hyperarc between 
a set of start nodes and a single end node l22l . A wireless 
relay channel consists of a source node s, a relay node r, and 
a sink node d. Source s has n packets of the same length to 
transmit to d. It broadcasts to both r and d, while the relay 
r assists the transmission by either forwarding the original 
packet, or computing linear combinations of received packets 
before forwarding the ensuing mixtures. 
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We assume transmissions occur in a rateless fashion, with 
minimal feedback: s and r take turns to transmit, until d 
acknowledges that it has received enough degrees of freedom 
(dof) to recover the original n data packets. Here we use 
dofs to represent linearly independent packets. Such rateless 
operations are often desirable in systems where feedback can 
be costly in terms of energy or delay. 

Our model considers packetized operations, independently 
of the physical layer implementation of the system. As such, 
erroneous packets are dropped, and channel losses are mea- 
sured by a time-averaged erasure rate. This separation of 
channel and network coding follows from the assumption that 
physical layer designs are already available for the underlying 
point-to-point link, with a relay being inserted for performance 
improvements. The transmission success rates are assumed 
to be psr between s and r, prd between r and d, and Psd 
between s and d. Nodes operate in half-duplex mode, where 
a node cannot transmit and receive at the same time. To avoid 
interferences and collisions in a contention based scheme, 
we consider a time-division framework, where s and r share 
the use of the wireless medium. A genie scheduler allocates 
the wireless medium to the source a fraction of the total 
time, < a < 1, and allocates the wireless medium to 
the relay the remaining 1 — a fraction of time. One possible 
implementation of such a genie-aided scheduler is to share 
the same randomness at s and r. Figure [Tfb) illustrates the 
maximum flow on each possible link in this network model, 
computed directly from the transmission success rates and the 
time-sharing constant a. 

In terms of memory, let both s and d contain n units, but 
assume r contains x units only, where 1 < x < n. r uses 
its memory as a queue: arriving packets are stored; if r is 
already full, newly arrived packets are discarded. If r does 
not perform coding, it sends to d a packet from its memory 
directly and drops this packet from the queue; if r performs 
RLNC before forwarding, it sends to c? a linear combination of 
stored packets, where each is weighted by a random number 
chosen uniformly from a finite field ¥q. In this paper, ¥q is 
assumed to be sufficiently large, such that linear combinations 
thus generated are linearly independent from each other with 
high probability. Reference ||27| discusses the tradeoff between 
field size and energy use in more details. 

To evaluate the amount of energy spent to deliver success- 
fully a packet from s to d, we define four different energy 
terms. Etr represents the transmission energy per packet, 
where transmission occurs at either s or r. Erx represents 
the reception energy per packet at r. The relay therefore pays 
for being on and listening to the broadcast from the source. 
Enc represents energy for generating a coded packet; it should 
be a function of n, since the complexity of network coding 
operations depends on field size and generation size, which 
is the number of packets coded together. Nonetheless, in this 
paper, Enc is assumed to be constant, representing a maximum 
allowable value. Lastly, Eack represents the amount of energy 
spent by s to listen to the final acknowledgement from d. Note 
that all energy terms are defined relative to s or r. It is assumed 
that the destination d represents a base station without power 
or energy constraints. 



IV. Network coding in the Wireless Relay 
Channel 

In this section, three different cases are examined: RLNC 
at both r and s, RLNC at r alone, and RLNC at s alone. 
For the first case, a fluid flow model is used to analyze the 
achievable rate, packet delivery energy, and the ratio of these 
two metrics. For the latter cases, we propose Markov chain 
models to characterize the expected completion time and the 
expected completion energy of transmitting n packets from s 
to d. We also offer a brief discussion on the use of systematic 
codes, which will be studied in future works. 

A. Coding at Both the Source s and the Relay r 

When RLNC is performed at both s and r, we can use 
the fluid flow model by Lun et al. Il22l . Il28l to study the 
rateless transmission of network coded packets through the 
relay channel. A packet is considered innovative when it 
carries a new dof to a node. In the relay channel, s injects 
innovative packets into the hyperarc (s, {r, d}) in a fraction of 
the total transmission time, while r injects innovative packets 
into the arc (r, d) in 1 — a fraction of the total transmission 
time. Packet transmissions form innovative flows in this setup 
because both s and r perform RLNC over a large number of 
packets. Each mixture is an additional dof relative to d. The 
amount of innovative flow is limited by the packet erasure 
probabilities. Assuming flow conservation at r, the maximum 
achievable rate R from s to d can be derived by solving 
the following mathematical programming problem analytically 
using Fourier-Motzkin elimination ll23l . 

min c(i?, a) 

S.t. R < a{psr + Psd ~ PsrPsd) 

R < apsd + (1 - a)prd 
0<a<l. 

1) Packet-Level Capacity Bound: c{R,a) ~ 1/R. This 
case is equivalent to maximizing R, making the optimization 
linear. A closed-form solution can be found: 

• Case 1 : p^d < Prd, then 

^* _ PrdjPsr +Psd -PsdPsr) 
Prd + Psr{^ - Psd) 
Prd 

Prd +Psr(^ -Psd) 

• Case 2: psd > Prd, then the relay is not used, and 

R*=Psd, a* = l. (1) 

2) Packet-Delivery Energy: c{R, a) — [Etx + Enc + a{l — 
Ia=i)Erx]/ R- We define the packet delivery energy as the 
energy consumed per successfully transmitted packet and state 
it explicitly in terms of packet transmission, reception, and 
coding energies. Ia=i represents an indicator function that 
equals to 1 if a = 1, and otherwise. In other words, if 
a = 1, the relay is not used thus should not consume any 
energy. Because the scheme considered is rateless in the limit 
of infinitely large payloads, no energy is spent on listening to 
the acknowledgement at s. Also observe that if Erx = 0, this 
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problem reduces to case 1), and the optimal a* which achieves 
the highest throughput also leads to a minimal packet delivery 
energy. If Erx is non-zero, we can rewrite the optimization in 
terms of a alone and solve numerically. 

3) Packet-Delivery Energy per Throughput Rate: c{R,a) = 
[Etx + Enc + a(l - 'i-a=i)Erx\)/R^- The ratio of the two 
metrics above is the expected packet delivery energy per 
throughput rate, which evaluates changes in both energy and 
throughput. 

B. RLNC at the Relay r Only 

First assume that s is limited to sending the original packets 
only, while r performs RLNC over all packets it has received 
and stored in memory. Since transmission is rateless with 
minimal feedback, s is unaware of the knowledge at r or d; 
such knowledge refers to specific dofs, representing a subspace 
spanned by received packets. We assume that when allowed 
to transmit, s chooses one packet uniformly at random from 
the n available uncoded packets. 

One remark here is that more exhaustive or frequent feed- 
backs would make retransmissions from s more Ukely to 
be innovative. For example, if per-packet acknowledgement 
is available, s can selectively repeat those not successfully 
received by r or d. On the other hand, if d can acknowledge 
the exact number of dofs received, r can adjust the number 
of coded packets it sends to maximize throughput to d while 
minimizing its own energy use. We consider transmission with 
minimal feedback, assuming that the cost of feedback is high. 

We describe the states of the system by a three-tuple 
(m, fc, /): m is the number of unique dofs at d, k is the number 
of dofs shared by d and r, and I is the number of dofs at 
r only. Since s does not code and r stores only uncoded 
packets, m and I represent the numbers of distinct packets 
at r and d respectively; e.g., if n = 3, r has successfully 
received packets 1 and 2, while d has received packets 2 and 
3, then {m,k,l) = (1,1,1). Once a packet at r has been 
mixed into a coded packet, which in turn is received at d, 
it becomes part of the shared dof between r and d; e.g., if 
r has received packets 1 and 2, while d has packet 3 and a 
mixture of packets 1 and 2, then {m,k,l) = (1,1,1), since 
the mixture represents a shared dof between r and d, while 
r contains one dof that is innovative to d. With such state 
definitions, any three-tuple satisfying m + k + l < n is a valid 
state. Transmission initiates in state (0, 0, 0), and terminates 
in states {(m*, fc*, /*)| m* + k* ~ n}. 

For transmissions to be free of collisions, recall from the 
system model that we assume there is a genie-aided scheduler, 
such that s and r do not access the wireless medium at the 
same time. In each time slot, s transmits with probability a, 
while r transmits with probability 1 — a. By randomizing 
the transmitter at each time slot, the state transition process 
becomes memoryless, and the numbers of dof at each node can 
be tracked through a Markov chain. The memoryless property 
holds because the probability of the next transmitted packet 
being innovative relative to both or either of r and d can 
be expressed in terms of a and the current state (to, k, I), 
independently of past state evolutions. With probability a, s 




Fig. 2. Markov cliain model, with added terminating state St - The number of 
packets to send at s is n = 2. Any three-tuple (m, k, I) satisfying m+k+l < 
n is a valid state. 



chooses one packet uniformly at random from its n uncoded 
packets, and with probability {n — m — k — l)/n, this packet is 
innovative to both r and d. Similarly, with probability 1 — a, r 
computes a linear combination of the content of its memory, 
and sends the mixture to d. Here we assume r has enough 
memory to store all n distinct packets. 

An alternative to the genie-aided randomized transmissions 
is a collision-free, deterministic schedule, where s and r take 
turns to transmit for a fixed amount of time, determined by n, 
a, and the channel conditions. Without feedback, the average 
system throughput should be the same as the randomized case. 
However, with this deterministic schedule, since packets are 
uncoded at s, counting the numbers of dof at r and d requires 
knowledge of the exact packets present. Even with a small n, 
it is hard to track the evolution of packets in the system. 

FigurelUgives a sample Markov chain when n = 2. We have 
drawn the Markov chain as a tetrahedron, with the starting 
state on top, and the terminating states on the left vertical 
edge. State transitions occur after the transmission of a single 
packet, either from s or from r. In the case where r has not 
received any packet successfully but is chosen to transmit, we 
assume the slot is wasted. At state {m,k,l), the transition 
probabilities can be computed by considering all outcomes of 
the transmission, assuming independent packet losses. 

When m + k < ji, s broadcasts with probability a, while 
r transmits with probability 1 — a. Let the indicator function 
I;>o be 1 when I ^ 0, and otherwise, then 



Pi 



n — m — k — I 



{m,fe,/}->{m+l,fc,/} 



Psdil~Psr)a (2) 



— m — k ~ I 



{m,fe, /}->{)«, fc+l,Z} 



-PsdPsrCt 



(3) 



Pi 



n—m — k — l 



{m,k,l}^{m,k,l+l} 



{m,k,l]^Un-l.k+l.l} — — PsrOt 



Psr{l-Psd)a. (4) 

(5) 



PUn.k.l]^{m.k+l.l~l} = -Psdd + ll>OPrd{l ~ o) (6) 
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{m, A; , / } — ^ { m , A: , / } 



711 . \ ^ / \ ^ 

— (1 -Vsr) + -(1 -Vsd) + - 

n n n 



k-l 



(1 -psd)(l -Psr) 



+ (1 - Ij>oPr<i)(l - a) 



(7) 



When s broadcasts, this transmission may succeed in any 
of the three hyperarcs originating from s, as shown in Fig- 
ure [Tfb). Depending on which packets r and d already have, 
a successful transmission may or may not lead to a transition 
to a different state. For example, if the transmitted packet has 
already been received by both nodes, a state transition does 
not occur regardless of whether the transmission is successful. 
In particular, the following state transitions are possible. 
1) If d receives one more dof, while r does not, 

Vsd(X -Psr), this term 



P{m,k,l}^{m+l,k,l} — 

is included in Eq. (|2]i; 
2) if both d and 



n — rn~k~l 



{m,k,l}~^{in.k+l.l} 



r receive 

n — rn—k—l 



one more dof, 
PsdPsr, i-e., Eq. ©; 



3) if r receives one more dof, while d does not, 

P{m,k,l}^{m,k,l+l} = "~"^,,r*'~' Ps>-(l - Psd), i-e., 

Eq. ©; 

4) if the broadcasted packet has been previously received 
by d, and is now received at r, P{m.k.i}^{m-i,k+i,i} = 
—Psr\ note that if m = 0, this transition probability is 
o" i.e., Eq. Q; 

5) if the broadcasted packet has been received by 
r previously, and is now received at d, then 
P{m,k,i}~,{m.k+i.i^i} = iPsd; if I = 0, this transition 
probability is 0, i.e., Eq. (|6]i; 

6) a self transition occurs if the dof being sent has pre- 
viously been received by d but not r, if it has pre- 
viously been received by r but not d, if it is already 
shared, or if it has not been shared previously, yet 
it is not received successfully by r nor d during this 
transmission; correspondingly, P{m^k,i}^{m,k,i} is the 
sum of four terms; —(1 — Psr), -(1 — Psd), —, and 

i.e., Eq. ©. 

r transmits coded packets with probability 1 — a. Observe 
that, when r has received only a small number of packets, a 
mixture it generates may not be innovative with respect to d, 
because the dof could have been received by d itself already. 
For example, if n is equal to 3, and d has already received 
packets 2 and 3, then a coded packet from r containing the 
weighted sum of packets 2 and 3 is not innovative even if it is 
successfully received at d, but a coded packet containing the 
weighted sum of packets 1 and 2 is innovative. 

Again, explicitly tracking the contents of coded packets is a 
difficult task. Instead, we assume that all packets transmitted 
from r to c? are innovative. It is important to note that the 
computed expected completion time under this assumption 
is a lower bound on the actual expected completion time, 
and the corresponding throughput is an upper bound on the 
actual system throughput. When discussing numerical results 
in Section [V] we shall show that even this upper bound on 
throughput is not efficient compared with schemes where 
coding is performed at s, or at both s and r. Also observe 
that if n = 1, an uncoded packet is always transmitted; the 



innovative packet assumption is then always true, with the 
computed expected completion time being exact. With such 
assumptions, the following state transitions can occur 

1) if r has no unique dof to share, / = 0, 

P{7n,k,a}^{rn,k,o} = 1. i-e., Eq. 

2) if r has a unique dof to share, / > 0, and d receives 
successfully, P{m,k,i}^{m,k+i,i-i} ^Prd, i.e., Eq. dH); 

3) if r has a unique dof to share, I > 0, but d does not 
receive successfully, P{m,k,i}^{7n,k,i} = 1 - Prd, i-e., 
Eq. 0. 

The transmission process terminates when m + k = n, and 
P{rn,7i~m.o}^{m.n-m.o} = 1- In this Markov chain, all states 
are transient except the absorbing states. When n > 1, multiple 
recurrent classes exist. Since there is a single starting state, 
there exists a unique steady state distribution. To simplify 
the computation of the absorbing time, we append a virtual 
terminating state St, such that P{m,n-m.,o}^{ST} = 1' ^"'^ 
PiSt}^{St} — I- To compute explicitly the transition state 
matrix, we can index the states linearly starting from St to 
(0,0,0). Let St be state under this counting notation. In 
Figure [2] we give one possible set of linear indices, starting 
from the bottom to the top of the tetrahedron. 

Let Ti be the expected first passage time to state 0, i.e., the 
expected number of steps to reach state 0, starting in state i. 
Also let T be the system expected transmission completion 
time. Hence T ~ Tiq in this example. Since there are no 
cycles in this Markov chain and the expectation operation is 
linear, Ti can be solved recursively using the equation 




To = Q,i^O. 



Alternatively, if we express the expected first passage times 
to state in vector form f = [ Ti T2 ... Tio ] ^, then 
T = 1 + P\oT, where 1 is the vector of ones, and P\o is the 
submatrix of P with the first row and first column removed. 
Solving this linear system of equations gives 



T = (I - P^ 



(8) 



Since the original Markov chain does not contain cycles other 
than loops, the states have a topological order; P is a lower- 
triangular matrix. In addition, since self-transition probabilities 
are non-zero for all states except those next to St, I — P\q is 
strictly lower-triangular, thus invertible in the real field. 

To determine the expected amount of energy consumed by 
s and r, let Ei be the expected energy to be computed, staring 
from state i, and E be the expected transmission completion 
energy, i.e., E = Eio + Eack- Here Eack is included since a 
single acknowledgement is sent by r at the end to signal the 
end of transmission. A similar argument holds as in the deriva- 
tion of Eq. (O, and a system of linear equation can be solved to 
fi_nd E^[E^ E2 ... Ewf = (I-P\o)"Hl^^«..e) = 
TE^se, where = Etx + a{l - Ia=i)Erx + (1 - a)Enc. 

One last observation is that it is possible to constrain the 
amount of memory at r to less than n, and have r function 
as an accumulator, such that whenever an innovative packet 
is received, it is multiplied by new random coefficients and 
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added to each of the memory units. When given a transmission 
opportunity, r uniformly randomly chooses one mixture from 
its memory. The achievable rate region of the limited memory 
case should be outer-bounded by the full memory case. We 
shall show in Section |V] that even with full memory, coding 
at r alone is not efficient. 

C. RLNC at the Source Only 

When RLNC is performed at s only, the analysis of the 
system performance is similar to the previous case, where 
RLNC is performed at r only. First, every mixture sent by 
s is innovative with respect to r and d under the infinite 
field size assumption. Let state (m, fc, I) represent m unique 
mixtures at d, k mixtures shared by r and d, and / unique 
mixtures at r. Assume r acts as a queue with x finite units 
of memory, where 1 < x < n. Here r is allowed to have 
fewer than n units of memory, since it does not need to store 
distinct packets for explicit coding operations, r receives linear 
mixtures directly from s, functions as a queue, and drops any 
mixtures received after it is full. Any mixture transmitted from 
r is also dropped from the queue. A state (m, k, I) is valid as 
long as m + fc < n and k + l < x. Again, s transmits a fraction 
of the time. We assume that P{m,k.i}^{m'.k'.i'} is non-zero 
only if both (m, k, I) and (m', k', I') are valid states. Let the 
indicator function be 1 if the logic function /(•) is true, 
and otherwise. State transitions occur after the transmission 
of a single packet, either from s or from r. The state transition 
probabilities are as follows. 



P{m,k,l}^{m+l,k,l} = apsd{i -Psr) + apsdPsA+l=x 

(9) 
(10) 
(11) 



P{m,k,l}^{m,k+l,l} 
P> 



aPsdPsAk+Kx 
{m,k,l}~^{m,k,l + l} = "Psi-(1 ~ Psd)'^k+l<x 
P{m,k,l}~f{m,k,l} = "Psr(l - Psd)'ik+l=x 

+ a{l ~ psr){l - Psd) + (1 - a)Ifc+(=o (12) 
k . 



{mMd]^{m.+lM-l,l} 



(1 



Pi 



{■m,kd}^{m+l,k,l-l 



} = (!-«) 



Pi 



l + k 
I 

l + k 
I 



Lfc>0 



Prdh>0 



'{mM.l}^{m,kd-l} — (1 - Q:) , , (1 - Prd)il>0 



(13) 
(14) 
(15) 



l + k 

In the case where r has not received any packet successfully 
but is chosen to transmit, the slot is assumed to be wasted. 
Assuming independent packet losses, the state transition prob- 
abilities are computed as described below. 

First, when 7ti + fc < 7i, s broadcasts with probability a, 
and the following can occur. 

1) If c? receives the transmitted mixture, but r does not. 



P> 



{m,k,l}^{m + lMd} 



- Psd{i -Psr), i-e., Eq. ©; 



2) if both d and r receive the transmitted mixture, and k + 
I < X, P{m,k,i}^{ni,k+i,i} = PsdPsr, i.e., Eq. ([Toll; 

3) if both d and r receive the transmitted mixture, and k + 

I = X, P{m,k,l}->{m+l,k,l} = PsdPsr, 1-6., Eq. 

4) if r receives the transmitted mixture, but d does not, 
and k + l < x, the mixture is stored in memory. 
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Fig. 3. Markov chain model, coding at the source s only, with added 
terminating state St- The number of packets to send at s is n = 2; the 
amount of memory at the relay r is x = 2. 



5) if r receives the transmitted mixture, but d does not, and 
k + l ^ X, the mixture is dropped, P{m,k,i}^{m,k,i} = 
P.sr(l -Psd), i-e., Eq. O; 

6) if neither r nor d receives the packet, 

P{7n,k,l}-,{m,k,l} = (1 -P6r)(l - Psd), 1-6., Eq. 

r transmits coded packets with probability 1 — a, and the 
following state transitions can occur 

1) If r has no unique mixture to share, I = 0, and fc = 0, 



P> 



{ m , /c , / } ^ {m , A; , / } 



1, i.e., Eq. ( fT2b ; 



2) if r has no unique mixture to share, / = 0, and A: > 0, 



P> 



{m,k,l}^{m+l,k-l,l} 



P> 



{m,/c,i}— >{m,A;,i + l} 



Psr{l ~Psd), i.e., Eq. ([TTll; 



1, , i.e., Eq. ( fT3] i; observe that 
since a packet is dropped from r's memory after being 
sent, k decrements by 1 since the dof is no longer shared, 
while m increments by 1 since this dof becomes unique 
to d; 

3) if r has a unique mixture to share, I > 0, and 

• a unique mixture is sent, d receives successfully, 

P{7n,k,i}^{7n+i,k,i-i} = iTkPrd, i.e., Eq. (dill; 
« a unique mixture is sent, transmission is unsuc- 
cessful, P{,n,k,l}^{m,k,l-l} = 1^(1 - Prd), i.e., 

Eq. O; 

• fc > 0, a shared mixture is sent, d receives 

successfully, P^rn,k,l}^{m+l.k-l.l} 

Eq. (O; 

• fc > 0, a shared mixture is sent, transmission is un- 
successful, P{mMd}^{m+lM-l.l} = i:pfc(l ~Prd), 

i.e., Eq. ( flJl l. 

Transmission terminates when m + k = n, and 
P{m,n-m,o}^{m,n-mM} = 1- Again, a virtual terminating 
state St can be appended, such that P{m.n~md}^{ST} ^ 1' 
and P[St}^{St} = 1- With this addition, the Markov chain 
has one recurrent state only. Figure |3] gives a sample Markov 
chain when n = 2. The states can be indexed linearly starting 
from St as state 0, to (0,0,0) as the last state, which 
corresponds to state number 14 in this example. 

Our goal is to find the value of a that minimizes either 
the expected completion time T = T14, or the expected 
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completion energy E = E14. Unlike the coding at r only 
case, the Markov chain now contains cyclic paths in addition 
to loops. The expected first passage time starting from different 
states is r = (I — P^q)"^!. Here, the invertibility of I — P\q 
is guaranteed because P\q has entries less than 1 on the main 
diagonal, and I — P\q is a lower Hessenberg matrix with non- 
zero entries on the main diagonal |r29i|. Once the value of a 
that minimizes T is found, we can compute the associated 
E, where E = TE^se + Eack- The only difference is the 
value of Euse, since coding is now performed at the source s: 

Euse = Etx + a{l - Ia=i)Erx + (1 - a)Enc- 

D. Use of Systematic Codes 

Systematic network codes are an attractive alternative to 
non-systematic random linear network codes, since they of- 
ten reduce computation complexity and energy use, while 
maintaining the innovation of independent flows |[30| . With 
a systematic code at s only, s can first broadcast the uncoded 
packets one by one in order, then compute random linear 
mixtures for all remaining packets transmitted from s. r per- 
forms the store-and-forward function always. In a sufficiently 
large finite field, since every packet sent by s is innovative 
with respect to r and d, if we view each uncoded packet 
as an innovative mixture, the state evolution under this setup 
is the same as the case where full RLNC is performed at s 
only. If systematic coding is performed at s and RLNC is 
performed at r, the system gives the same performance as 
the case where RLNC is performed at both nodes. Another 
possibiUty is to perform systematic coding at both s and r. s 
first broadcasts uncoded packets one by one in order. It then 
computes a random linear mixture of all n packets whenever 
a transmission opportunity becomes available, r acts as a size 
n queue. When the relay has the opportunity to transmit, 
it examines the next packet in the queue. If this packet is 
uncoded, r transmits the uncoded packet directly. If this packet 
is coded, r linearly combines all data it has in memory before 
sending out the mixture to d. The system performance under 
this setup should be upper-bounded by the full coding case, 
and lower-bounded by the coding at s only case. The analysis 
of this additional systematic phase is non-trivial, so we leave 
its description and discussion to a later time. 

V. Numerical Results 

This section compares the performance of the three schemes 
discussed in Section |IV] under different channel conditions. 
We first consider the coding at r only and coding at s only 
cases and examine the expected transmission completion times 
per data packet. We then compare the three cases in terms 
of achievable throughput, computed as the inverse of average 
completion time, and packet delivery energy. 

A. RLNC at the Relay r Only 

Figure in plots the expected completion time per transmitted 
data packet as a function of a for different values of n, when 
Psd = 0.5, Psr — 0.8, and -prd ~ 0.8. Recall that n represents 
the number of data packets to be transmitted by the source to 



the destination. The optimal a* value that achieves the lowest 
T* jn is indicated by a large dot on each curve. 

In this figure, when a = 1, r listens but does not transmit. If 
77 = 1, the expected number of transmissions per data packet 
is 2. This is the solution to the ARQ scheme when = 0.5, 
where each packet is retransmitted until successfully received 
at d. When ri = 2, the expected number of transmissions per 
data packet is 3. Observe that, since r is unused and s does 
not code, s simply retransmits one of the two uncoded data 
packets each round, until both are received at d. This scenario 
is similar to the coupon collector's problem with 2 coupons, 
except packet erasures need to be taken into account. When 2 
coupons are to be collected, the expected number of trials until 
success is 2 X (1 + i) = 3. If divided by -psd and normalized 
by the number of packets, this solution leads to the value of 
3, the value on the curve 77, = 2, at a = 1, in Figure |4l 

Another observation from this figure is that, as n increases, 
the expected completion time T jn increases as well. This 
increase comes from transmissions by s. Since s randomly 
chooses one from n packets to transmit, a packet to be 
transmitted would have been received by r or d already 
with non-zero probability. This effect is especially significant 
towards the end of the transmission, when d has collected 
most of the dofs. In addition, the optimal a values, which 
correspond to the horizontal coordinates of the large dots, first 
decrease in value as n goes from 1 to 5, then increase in 
value as n increases to 20. This effect indicates that a tradeoff 
exists between the use of the relay and the amount of wasted 
retransmissions by the source. 

Although not explicitly shown here, we can plot and com- 
pare the expected completion time per data packet when the 
channel between s and d varies. It can be observed that when 
■psd increases, s is used a larger fraction of the time, with a 
becoming 1 if -pad is larger than p^d, similar to the coding at 
both nodes case discussed in Section HV-AI 

From the above numerical evaluations, we can conclude that 
RLNC at the relay r only while operating in a rateless fashion 
is not an efficient transmission scheme in terms of throughput. 
Figure m shows that using ARQ without coding (77 = 1, a = 
1) achieves the best expected completion time, or the best 
throughput. However, one issue with the 72 = 1 case is that 
each data packet, when transmitted successfully, requires an 
acknowledgement from d, i.e., Ejn ~ TEuse + Eack- Such 
frequent feedbacks are not energy efficient. If 77 > 1, even 
though the effect of the Eack term is mitigated by amortization 
over a larger n, the large increase in the value of T/77 shown 
in Figure 2] indicates that coding at the relay only is the most 
energy efficient when 77 = 2. In Section [V-CI we shall compare 
the energy use of this particular case with other schemes. 

B. RLNC at the Source s Only 

Figure |5] plots the expected completion time per data packet 
as a function of a, when n and x vary. 77, is the number of 
data packets to be transmitted by s, and x is the amount of 
memory available at r to store received mixtures. Unlike the 
coding at r case, here T /n decreases as n becomes larger, 
because each packet sent by s is innovative relative to r and 
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Fig. 4. Coding at the relay r only, expected completion time per packet Fig. 6. Coding at the source s only, expected completion time per packet 
T/n vs. a, as n changes in value; Ps^; = 0.5, psr = 0.8, p^d = 0.8. The T/n vs. a, as psr and p^d change; p^d = 0.5, n = 10, x = n. 
optimal T* /n is labeled with a large dot on each curve. 
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Fig. 5. Coding at the source s only, expected completion time per packet 
T/n vs. a, as n and x changes in value; = 0.25, psr = 0.8, p^d = 0.8. 
The optimal T* /n is labeled with a large dot on each curve. 

d, and as more packets are combined, the probability that a 
mixture sent by r is innovative becomes larger In addition to 
reducing T/n, another advantage of coding n packets together 
at s is that the cost for feedback can be amortized over a large 
number of data packets. Also observe from this figure that as 
low as a; = 3 units of memory suffices to achieve the expected 
completion time of the full memory case (i.e., x = n). 

Figure |6] plots the expected completion time per data packet 
as a function of a, for different psd, Psr, and p^d- Comparison 
among curves (1), (4) and (5) show that r should be given 
more time to transmit when the tandem link from s to d 
through r is more reliable than the direct link between s 
and d. Comparison between (3) and (4), however, show that 
r should not be used if the channel between r and d sees 
large packet losses, even if the channel between s and r is 
relatively reliable. This observation echoes the decision of not 
using the relay in the full coding case, as given by Eq. ([T]), 
and discussed in Section HV-AI Moreover, comparison among 
curves (2), (3), and (4) show that the optimal value of a is a 
function of channel conditions. 



C. Comparisons 

Figure [T] compares the maximum achievable rates of three 
cases; coding at r only as discussed in Section lTV-BI coding at 
s only as discussed in Section lTV-CI and coding at both s and r 
as discussed in Section ITV-AI Figure [8] plots the corresponding 
a* values that achieve these rates. For the coding at r and 
coding at s cases, the metric being plotted is the inverse of 
the optimal expected transmission completion time per data 
packet (T*/n). This inverse corresponds to the throughput R* 
of the systems under discussion. For the case where coding 
is performed at both s and r, the achievable rate is computed 
using Equations (1) and (2). 

When RLNC is performed at r only, as previous discussions 
have suggested, it is more desirable to mix fewer number 
of packets; since packets retransmitted from s are uncoded, 
a larger fraction of the repetitions are wasted. In Figure |7] 
the achievable rates are given for two different values of n. 
When n = 1, coding is not performed, hence the transmission 
degenerates into a routing scheme: s and r retransmit a 
single packet until an acknowledgement is received from d. 
Observe from Figure [8] that when the channel between s and 
d is poor (e.g. psd = 0.2), the route through r is preferred 
{a* ^ 0.65), otherwise r is not used {a* = 1). When 
n — 2, r still performs network coding, but only as the sum 
of two packets. Recall the assumption that all mixed packets 
transmitted by r are innovative relative to d; the second curve 
('?■, 77, = 2,x = 2') in Figure |7] is therefore an upper bound 
on the actual system throughput, reconfirming that coding at 
r only is not throughput efficient. 

When RLNC is performed at s only. Figure [7] shows that 
more than 69% of the rate attained by the coding at both 
nodes scheme can be achieved. Here the achievable rates are 
plotted for only one set of channel realizations, with psr = 0.8 
and prd = 0.8. The exact amount of coding gain depends 
on the reliability of all three links in the relay channel. Also 
observe that the performance gap decreases as the channel 
between s and d becomes more reliable. Moreover, Figure [8] 
shows that, when coding at s only, transmissions from r 
are not required after psd becomes reasonably good (e.g.. 
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Fig. 7. Achievable thu'oughput as a function of Ps^j, R* = Tjrjy^; Psr = 0.8, 
Prd = 0.8. 
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Fig. 8. Optimal a* con'esponding to throughput values in Figure [T] Psr = 
0.8, Prd = 0.8. 



Psd > 0.442). This is because transmissions from r follow 
a randomized scheme, leading to redundant repetitions that do 
not contribute additional dof to d. 

In maximizing throughput, coding as much as possible while 
fully using the relay r seems to be the optimal strategy, 
followed by coding at s alone. However, assuming that both 
coding and listening costs power, such approaches may pay 
higher costs in terms of energy. As discussed in Section |IV] 
our analysis enables the derivation of total energy costs. For 
example, if Etx, E^x, Enc, Back are identically 1, Figure |9] 
plots the packet deliver energy corresponding to the optimal 
a* in Figure |8] while Figure [TO] plots this energy consumption 
scaled by the maximum achievable rate. The different energy 
terms have been chosen assuming that coding and listening 
consumes energy on the same scale as transmission. Such as- 
sumptions are valid in systems where just having the circuitry 
turned on constitutes the most significant portion of energy 
use. Other ranges of values are also possible, as we have 
discussed in tlj, depending on the underlying physical layer 
hardware implementations. 



11 




1 - 

gl , , , , , , , , 

0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 

Psd 



Fig. 9. Packet delivery energy E/n as a function of p^d, coiTesponding to 
the optimal a* in Figure IT] psr = 0.8, p^d = 0.8, Etx = 1, Erx = 1, 
Enc = 1, Ecck = 1- 
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Fig. 10. Packet delivery energy per thi'oughput rate E/{nR) as a function 
of Psd, coiTesponding to the optimal a* in Figure|71 psr = 0.8, Prd = 0.8, 

Etx = 1, Erx = 1, Enc = 1, Eack = 1- 

It is easy to see from these figures that when is low, 
coding at both s and r is the most throughput and energy 
efficient, while coding at s alone provides a compromise 
between throughput and energy use; under better channel 
conditions, however, not coding {n = 1) and not using the 
relay {a = 1) require energy, while achieving equally good 
throughputs. At psd = 1, the energy cost for the successful 
delivery of one data packet is 2 when coding is conducted at 
both s and r: one on transmission, and one on coding. On 
the other hand, the energy cost for coding at s only, assuming 
n = 10, is 2.1: one for transmission, one for coding, and 1/10 
for listening to transmission termination acknowledgement. 
Moreover, the energy cost for coding at r only, assuming 
71 = 2, is 2: according the coupon collector's problem, on 
average 3 units of energy are spent on transmitting the 2 
packets, and one unit of energy is spent on receiving the 
acknowledgement. Lastly with simple ARQ (n = 1), two units 
of energy are spent on each successfully delivered packet. 
Under the same channel conditions and system parameters 
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Fig. 11. Optimal a* corresponding to packet delivery energy values in 
FigureEl psr = 0.8, Prd = 0.8, Etx = 1, Er^ = 1, = 1, E^ck = 1- 
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Fig. 12. Minimum packet packet delivery energy E* /n as a function of 

Psd, Psr = 0.8, Prd = 0.8, Etx = 1, Erx = 1, E„c = 1, E^ck = 1- 



as given in Figure |9] optimizing for energy use leads to 
a very different set of a values, plotted in Figure [TT] The 
corresponding optimal packet delivery energies are shown in 
Figure [12] Observe that the decision to turn off the relay 
r entirely comes at smaller psd values. This is because r 
consumes energy in listening to incoming packets from s as 
well as sending outgoing packets to d. The energy cost of using 
r is the same as retransmitting twice from s. In addition, since 
r shares the use of the wireless medium with s, having r turned 
on reduces the rate at which packets can be transmitted from 
s. With these two effects combined, r is used only at small 
Psd values. Another result of the energy tradeoff between s 
and r observable from these two figures is that even though 
the optimal packet deliver energy curve is continuous, a* sees 
a jump for each of the coding strategies. 

Similar energy and throughput curves can be evaluated when 
energy parameters Ef^, E^x, E^c and Eack take on different 
ranges. In practical systems, depending on the underlying 
circuit implementation, one or more of these energy terms 
can dominate over the others, and the optimal transmission 



schedule could be very different from the ones shown above. 
Nonetheless, our analysis enables robust decision making to 
determine when and where to code in a wireless packet erasure 
relay channel. 

VI. Conclusion 

We propose Markov chain models to analyze the throughput 
and packet delivery energy performances of network coding 
strategies in the wireless packet erasure relay channel. The 
evolution of innovative packets are tracked when either or 
both the source and the relay perform random linear network 
coding. We show through numerical evaluations that using 
a random code at the relay alone is neither throughout nor 
energy efficient, while coding at the source alone can provide 
a good tradeoff between throughput and energy use. We 
also show that only a very small amount of memory is 
required at the relay when coding is performed at the source 
only. Although we do not attempt to categorize explicitly 
the optimal network coding strategies in the relay channel 
under different system parameters, we provide a framework 
for deciding whether and where to code, taking into account of 
throughput maximization and energy depletion. Future work 
will consider the use of systematic codes, which have been 
mentioned in this paper but not studied in detail. A natural 
extension of the three-node relay channel is a star-shaped 
network, where nodes can act as relays for their neighbors. 
A direct generalization of our given framework does not seem 
tractable, nonetheless it is clear from our short analysis that 
the problem of choosing an optimal coding subgraph is very 
important when practical constraints, such as energy, are taken 
into account. 
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